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ABSTRACT 

This is the first in a series of papers that analyze college student beliefs in realms where common astronomy 
misconceptions are prevalent. Data was collected through administration of an inventory distributed at 
the end of an introductory college astronomy course. In this paper, we present the basic mathematics of 
item response theory (IRT), and then we use it to explore concepts related to galaxies. We show how IRT 
determines the difficulty of each galaxy topic under consideration. We find that the concept of galaxy spatial 
distribution presents the greatest challenge to students of all the galaxy topics. We also find and present the 
most logical sequence to teach galaxy topics as a function of the audience's age. 

1 INTRODUCTION 

Galaxies, a concept presented to most American students in their K-12 years, are the source of a 
variety of misconceptions (deeply held beliefs that are inconsistent with currently accepted scientific concepts, 
Comins 2001). Motivated by an interest in understanding when these, and myriad other misconceptions 
about astronomy, are replaced with scientifically correct knowledge, one of us (NFC) developed an inventory 
to explore this issue. Presented to college students for extra credit upon completion of an introductory 
astronomy course, the inventory has provided us with interesting insights into the process of unlearning 
galaxy-related and other misconceptions. A total of 403 students over four semesters availed themselves of 
the opportunity to participate in the inventory. The analysis of that data led us to postulate different orders 
for teaching this material as a function of grade level in K-12 and college. 
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The effect of misconceptions on understanding of astronomy concepts has been analyzed in a number 
of studies (e.g. Vosniadou, Vamvakoussi, & Skopeliti 2008, Bailey et al. 2009, Sadler et al. 2010, Wallace, 
Prather, & Duncan 2011). Because stars, the building blocks of galaxies, are an important topic in astronomy 
as gauged by textbook coverage, course curricula, and the National Science Education Standards for middle 
school, high school, and college students, Bailey et al. (2009) assessed preinstructional ideas about stars and 
star formation held by 2,200 non-science majors taking an introductory astronomy course (preinstructional 
meaning prior to starting the course). They cite "the constructivist movement of the late 20th century," 
which suggests that effective instruction requires the guidance of preinstructional student ideas, and further 
emphasize from Bransford, Brown, and Cocking (1999) that student understanding of astronomical concepts 
after instruction strongly depends on whether teachers ignore or teach directly to student preinstructional 
ideas. Bailey & Slater (2003) note that the constructivist movement is a "student model-building paradigm" 
founded on how students synthesize information and experiences into conceptual models. Motivated by the 
constructivist pedagogy, Bailey et al. (2009) observed from their study that students often bring misin- 
formation to the classroom, which is then often combined with accurate information learned in the course 
into a "synthetic model" (Vosniadou, Vamvakoussi, & Skopeliti 2008), thus leading to an often incorrect 
understanding of stars and their formation. 

Sadler et al. (2010) conducted a study of 7599 students and their 88 teachers spanning grades 5-12 
on a variety of astronomy topics. The researchers refer to a number of educational standards, notably the 
NRC Standards and AAAS Benchmarks, in assessing the required knowledge for "astronomical literacy." 
Based on these standards and misinformation held by students, the researchers designed a series of multiple- 
choice tests with wrong answers citing misinformation held by students. The study showed that teachers 
"dramatically overestimate their students' performance" in that teachers fail to recognize the extent of their 
students' misconceptions. Their findings suggest that teachers need to be made more aware of their students' 
misconceptions in teaching astronomy topics. 

The classroom is not the only source of information on astronomy, of course, and students may ac- 
quire accurate and inaccurate information from life experience, their own reasoning, and media presentations 
(Comins 2001, Libarkin et al. 2011). Even with highly competent instruction in the classroom, students do 
not necessarily acquire accurate scientific information in a straightforward linear progression; there are often 
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"apparent regressions" (National Research Council 2007, p. 95) that may even be necessary for eventual 
understanding. Research on patterns of students' endorsement of accurate and inaccurate statements about 
astronomy at various stages in their educational careers is important to the design and refinement of effective 
instruction (Wallace, Prather, & Duncan, 2011). 

Evidently, careful guided instruction directed specifically at both student misinformation and inter- 
pretation of data is essential in ridding oneself of misinformation. Yet none of these studies have attempted 
to quantify the difficulty of correctly learning each astronomy concept. The studies use common misinfor- 
mation to design tests to score children, adolescents, and college students, but the tests do not consider how 
hard it is for the participants to realize that their misinformation is in fact wrong. We were thus led to 
propose the following research questions: 

1. Which galaxy misconceptions are the most persistent for college students? 

2. What can we learn about the best order to teach the associated topics? 

In the present paper, we explore both of these issues. By determining the most persistent misconceptions, 
we can suggest topics requiring care in presentation and, in all likelihood, more time in the classroom. This 
work also suggests the most productive order in which to present material concerning galaxies. We are 
presently testing the effectiveness of this order compared to the order presented in textbooks. 

This paper is organized as follows. In Section 2, we discuss the methodology of our study, including 
aspects of our inventory design, administration, and collection of data. Section 3 introduces the mathematics 
of item response theory, which we use to "score" the difficulty of the misconceptions. Section 4 shows how 
we use factor analysis to group the galaxy items presented in this paper into 3 subsets. Section 5 presents 
the results of our analysis using item response theory, and we conclude with our findings in Section 6. 

2 METHODOLOGY 

The rationale for this study is to demonstrate how our inventory of astronomy misconceptions quan- 
tifies the difficulty of the concepts independent of teaching pedagogy, explicit demonstration of scientific 
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understanding, or conceptual change frameworks. Rather than score student performance on multiple-choice 
tests or diagram interpretations, college students are asked to respond to a series of astronomy miscon- 
ceptions, in the form of statements. Because we did not want to tell students that the statements are all 
misconceptions, we referred to the statements in this inventory as "beliefs. " The students are asked to in- 
dicate when, in their lives, they were led to believe that each statement is actually wrong, or if they still 
perceive it to be true. Because our study does not use the constructivist pedagogy discussed in the afore- 
mentioned studies, our inventory has the distinct advantages of (1) being completely independent of teaching 
technique, (2) pinpointing times in the students' lives as to when students unlearned misinformation, and (3) 
extracting the most challenging topics associated with highest retention of misconceptions. Our study thus 
clarifies the most, vs least, challenging concepts to learn so that instructors can allocate their time logically 
by teaching directly to the most challenging misconceptions. 

Our study analyzes data from a comprehensive inventory of astronomy misconceptions administered 
to students by NFC, professor of physics and astronomy, and instructor of the introductory astronomy lecture 
course at the University of Maine. The AST 109 Misconceptions Inventory (AMI, see Appendix [A] for a list 
of acronyms used in this study) was administered at the end of the semester on a voluntary basis to students 
enrolled in the course. In the AMI, students filled in a scantron to respond with 6 options as indicated in 
Table [T] reflecting when they learned to reject a misconception or whether they still retained it even after 
taking AST 109. All the students in our study were of college age. 

Table 1: The 6 response options for the AMI 

A if you believed it only as a child 

B if you believed it through high school 

C if you believe it now 

D if you believed it, but learned otherwise in AST 109 

E if you never thought about it before, but it sounds plausible or correct to you 

F if you never thought about it before, but think it is wrong now 

The purpose of the inventory is to determine when students disabused themselves of various astron- 
omy misconceptions, or if they still perceive them as true. The statements in the inventory used in this paper 
were based on misconceptions identified by NFC between 1985 and 1995 (Comins 2001). All the participants 
in our study were of college age. Students were also encouraged to write a one-line comment to correct any 
wrong-sounding statement. These comments helped us determine whether the student beliefs are currently 
correct. The inventory, including a code for each statement in order of appearance, is provided in Appendix 
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[B] Table [2] lists the semester, class enrollment, number of students participating in the AMI, and number 
of items in each administration of it. The textbook used in each semester was the most current edition of 
Discovering the Universe by Comins and Kaufmann. 

Table 2: Semester, class, enrollment for the AMI 

Semester Class Size N in Study AMI Items 



Fall 2009 188 118 267 

Fall 2010 175 105 235 

Fall 2011 171 91 235 

Fall 2012 170 93 235 



The AMI covers such a broad range of topics in astronomy that a complete statistical survey on 
all AMI data requires us to submits several papers. As such, this paper is the first in a series that will 
cover the entire inventory. The section on galaxies was chosen for this paper because galaxies constitute 
very distant objects that we generally cannot observe with the naked eye. Their properties are thus studied 
and published most typically by professional astronomers, rather than hypothesized by the general public 
through their personal experiences or observations. As a result of NFC's teaching since 1985, 12 common 
galaxy misconceptions were identified, employed in the AMI, and analyzed in this paper. The statements 
were coded in order of presentation in the inventory, with galaxies coming near the end of the inventory and 
the prefix "sA" for "statement revision A." The statements are presented in Table [3j Galaxy statements 
sA223 and sA229 (see Appendix [B]) were omitted from the study. 

Table 3: Galaxy statements in the AMI 

sA218: the Milky Way is the only galaxy 

sA219: the solar system is not in the Milky Way (or any other) galaxy 

sA220: all galaxies are spiral 

sA221: the Milky Way is the center of the universe 

sA222: the Sun is at the center of the Milky Way galaxy 

sA224: the Sun is at the center of the universe 

sA225: there arc only a few galaxies 

sA226: the galaxies are randomly distributed 

sA227: we see all the stars that are in the Milky Way 

sA228: all galaxies are the same in size and shape 

sA230: the Milky Way is just stars — no gas and dust 

sA231: new planets and stars don't form today 



The general theme of this research is to identify the most persistent astronomical misconceptions. 
Thus, the focus of this paper is the recoding of student response to measure misconception persistence on a 
representative scale of numeric graded response options. Out of the original 6 options (Table [lj, we recoded 
student responses in the manner described in Table [4] Lower scores indicate younger ages at which students 
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dispelled misconceptions. Higher scores indicate increased misconception persistence. For example, a score 
of 3 indicates a misconception that still persists with the student. 

Table 4: Recoded student responses for the AMI 

Response Score Interpretation of the score 

A or B 1 The student dispelled the misconception during childhood or adolescence 

D or F 2 As a result of taking AST 109, the student learned to reject the misconception 

C or E 3 The student finds the misconception true even after taking AST 109 



3 ITEM RESPONSE THEORY 

3.1 Comparison of IRT vs. CTT 

As noted by Wallace & Bailey (2010) and Morizot, Ainsworth, & Rcisc (2007), the common practice 
in psychometric analysis is to use classical test theory (CTT), which compares the difference in the observed 
vs. true participant score, or the observed score variation vs. the true score variation. The reliability 
of CTT depends on parameters that are strongly influenced by the sample. Our alternative is to use item 
response theory (IRT) methodology, in which item difficulty is established independent of participant abilities. 
IRT is also capable of clarifying the extent of discrimination between two participant groups, that is, "to 
differentiate between individuals at different trait levels" (Morizot, Ainsworth, & Reise 2007). As Kline 
(2005) and Funk & Rogge (2007) note, given a large sample size for a group of well-correlated items, the 
standard error of the mean per item converges more rapidly in IRT than in CTT, indicating that IRT is 
generally more precise than CTT. We have thus decided to use IRT for its improved reliability over CTT to 
analyze student responses to the AMI. 

Item response theory (IRT) is a statistical theory that distinguishes the latent trait (designated 
"ability") of a participant from the difficulty of a set of items with well-correlated response patterns. IRT 
methodology assumes unidimensionality among the items in the set of items, that is, a group of items are 
assumed to have well-correlated response patterns such that the difficulty of each item can be reasonably 
compared. As noted by McDonald (1999) and Morizot, Ainsworth, & Reise (2007), unidimensionality is the 
property that the items in the set are locally independent, meaning, they vary only in difficulty and each 
probe essentially the same concept. They further note that the items themselves should have enough common 
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variance to give reasonably unbiased estimates of item difficulty. They state that in general, an unbiased 
analysis for dichotomously-scored items (those with two possible response codes, e.g., or I) may have as 
few as 100 participants, whereas 5-point response formats require a sample size of at least 500 participants. 
In the AMI, we performed IRT analysis with 3 possible scores for the 403 respondents, so we can reasonably 
assume that our IRT analysis will not be subject to low sample-size bias. 

3.2 The 1 and 2 parameter logistic models 

Student responses are tabulated based on the frequency of the scores in Table |4j allowing us to 
construct a logistic curve model for the probability distribution of the AMI scores (Wallace & Bailey 2010, 
Morizot, Ainsworth, & Reise 2007, McDonald 1999, and references therein). Logistic curves indicate the 
probability of obtaining at least a certain score in respondents with different trait levels. The idea is that 
the logistic curves are related to an individual item, so they are often called item characteristic curves. Since 
part of our analysis involves polytomous scoring (with more than two possible responses per item) , we first 
describe item characteristic curves for dichotomous scoring, then generalize to polytomous scoring using the 
graded response model. 

In the one-parameter logistic (1PL) model, or Rasch model, the probability for participant p whose 
ability level is 8 to obtain a score X p j (typically taken to be 1 for a correct answer and for an incorrect 
answer) for a dichotomously scored item j of difficulty bj is given by 

P(X pj = l\6,b j ) = (l + e-V-W)- 1 . (1) 

The plot of P(X p j — l\6,bj) vs. 8 is called the item characteristic curve (ICC). The ICC in Figure [TJhas 
the parameter bj = 0.8, such that at the ability level 8 — 0.8, participant p has a 50% chance of responding 
with the correct answer. This axis is scaled on a z-score metric such that its mean and standard deviation 
are respectively and 1 (Morizot, Ainsworth, & Reise 2007). 

We should note that term "ability" in IRT literature is somewhat arbitrary (Morizot, Ainsworth, 
& Reise 2007). The 1PL is used to determine the probability of responding with the correct answer, as a 
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P(X pj =1 \6,bj=0.8) 




Figure 1: Graph of an ICC with the parameter bj = 0.8, where P(X p j = l\8, bj = 0.8) is the probability of responding with the 
correct answer, and 9 is the ability of the participant, so that the ability axis is the horizontal axis. Participants with a higher 
ability level than bj have more than a 50% chance of responding with the correct answer 

function of some latent trait of the participants. An example of a latent trait used by Thorpe et al. (2007) 
in their study of common irrational beliefs is to graph the 1PL as a function of the participants' irrationality, 
in which case "irrationality" would be the label for the 9 axis. For our study, we will refer to the ability (9) 
axis as a scale of misconception retention throughout the paper. 

In the 1PL model, the shape of the probability distribution is the same for each item in a group. 
That is to say, the spread of the probability of right vs. wrong answers is assumed to be the same for each 
item. For our study, we note that not all the galaxy items probe the same specific concept, so some items 
will more sharply discriminate higher and lower-achieving participants than other items. In Figure [2j the 
steepness, or discrimination, at bj — 0.8 of the second ICC (in red) is 3 times higher than that of the previous 
item (in blue). 

In order to incorporate changes in the probability distribution shape among the grouped items, we 
use a two parameter logistic (2PL) model, as suggested by Wallace & Bailey (2010), with a discrimination 
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P(X pj =1 \ 6,bj=0.8) 




Figure 2: Graph of two ICCs, each with the parameter bj = 0.8. The red ICC discriminates between higher and lower-ability 
participants 3 times more than the blue ICC 

parameter cij that measures the steepness of the ICC. The 2PL model is written as 

P(X„ = = (l + e-'t*- 6 '))- 1 . (2) 

A high value for aj corresponds to high item discrimination, which indicates a strong division between 
higher and lower-achieving participants for the item. Values of a,j — ► correspond to a broader mixing of 
participant misconception persistence levels. There further exists a 3-parameter logistic (3PL) model, which 
incorporates a guessing parameter Cj for right or wrong answers. The 3PL model is not applicable to our 
study, because the AMI is not a multiple- choice test, so Cj = 0. 

3.3 The graded response model 

While the 1PL and 2PL models are used in IRT for dichotomous scoring, a polytomous model allows 
for more than 2 responses. Hence, there will be more than one "characteristic curve" to plot the probability 
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of responding with a particular score. The more proper term to use to describe each curve, as noted by 
Reeve and Fayers (2005), is category response curve (CRC). We will thus hereafter refer to the curves as 
CRCs. 

Statistical analysis in the AMI involves polytomous scoring. To perform item response theory 
methodology on polytomous scoring, one may choose from a number of models (e.g. Edelen & Reeve 2007). 
Of these models, the graded response model (GRM), first introduced by Samejima (1969), is most relevant 
to us, since we intend to secure the order of student responses throughout our analysis. In the GRM, the 
total probability of any response is normalized to 1, that is, 

K 
k=l 

for K scores. One can thus plot all of the CRCs on the same graph, as in Figure [3| 




Figure 3: Graph of five 2PL CRCs according to Samejima's graded response model, with the second through fifth CRCs 
(indicated in purple, blue, green, and yellow) having the parameters bjk = —2.0,0.0,1.5,2.5. In Samejima's graded response 
model, the parameter of the first CRC is determined by the other CRCs in the plot 



Because of probability conservation, K — 1 CRCs have a defined parameter, and we follow the 
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convention introduced by Samejima to chose the locations of all but the first CRCs. Hence, in Figure |3j 
the second through fifth CRCs, indicated by purple, blue, green, and yellow, have the respective parameters 
bjk = —2.0,0.0,1.5,2.5, whereas the first CRC, in red, has no parameter. We specifically note that instead 
of just bj, we now require that b have two subscripts, one for item j, and one for the particular score k for 
K — 1 unique parameters. 

Following the construction by Samejima (1969), in which k goes from 1 to K, the 2PL function for 
the highest score, PjK, is given by 

P jK (d) = (1 + e-^-^))- 1 . (4) 

The 2PL for the K - 1 th score is then 

Pjjc-xW) - (1 + e -M0-^--i))-i _ PjK (e). (5) 
To conserve probability, the 2PL for the K — 2th score is then 

Pj,K-2(e) - (1 + e -«^-^-0)-i - P^ K _ X {6) - P 3K (6). (6) 
This pattern continues until we get to the lowest score, which is 

P jl (6) = l-P j2 (9)-...-P jK (6). (7) 

Samejima introduces the notation 

where Pji-(@) * s the sum of all CRCs from Pj.k+i up to PjK- Comprehensively the probability of responding 
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with score k in the graded response model for each item j is 



Pjk(0) = { 



1 

(l + e -a 3 -(S-6^))-l -Pf k {9) 

(1 + e _Q j( 6 '- f >jA-))-i 



for k = 1, 

for 2 < fc < X - 1, 

for k = K. 



(9) 



3.4 Item information 



In Fisher information theory, the item information is statistically the variance of the score (Lehmann 
& Casella 1998); more conceptually, that information is a relative measure of how reliable the value of an 
CRC is at 6, or how well each score is being estimated at 8. Hence, in IRT literature, one typically refers to an 
item as being "most informative" (or "best estimating" each score) where the item information curve peaks. 
Away from these peaks, where 1(9) is lower, the scores are not estimated very well, and so the reliability 
of the CRC values for the score decreases with information (Baker 2001). We thus seek a relationship 
between the CRCs and the information curves for a group of items. Extending Fisher information theory 
to polytomously-scored items, the item information curve for item j is given (Chajewski & Lewis 2009) 

by 




Figure [51 shows the total information 1(9) for the example in Figure ^ 



Local information maxima are obtained by finding the abilities 9 S that satisfy 




where 9 S marks the misconception persistence location of a transition from one score to another. A quick 
approximation for 9 S between two neighboring CRCs with scores k and k + 1 can be found by estimating the 
coordinates of the information peaks. Our estimation assumes that the contribution to 1(9) from each of the 
other CRCs is negligible, in other words, for another score n, Pj n ~ 0. Our approximation thus applies to 
both the red-purple and purple-blue CRC intersections of Figure [3] which represent the first two information 
peaks in Figure [4] 
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K0) 



/ \ r° 




/ \ / 1,5 




/ \ / 10 




/ 0.5 





2-10 1 2 

Figure 4: Graph of the total information for the five CRCs in Figure [3] 



If, in Eqn. (11 1, we assume that two neighboring CRCs overlap with Pj n w 0, then 



1 (dP jk {6) 



d9 \P jk {0) V do 



1 fdP jtk+1 (9) 



d6 \Pj,k+i(0) \ d6 



After taking the derivative and setting 9 = S , one has, on the left-hand side, 



dP jk (6 s ) ( 2 d 2 P 3k {9 s ) 1 fdP jk (6, 



d(L \P jk (9 s ) d9l Pf k (0 s )\ dH„ 

and, on the right-hand side, 

dP jtk+1 (9 s ) ( 2 d 2 P^ k+1 (0 s ) 1 (dP jtk+1 {9 s ) 



d9 s \P jjk+1 (0 s ) d9 2 s Plk+M V d0 a 



The only way for the left and right sides of the equation to be equal is if 

dP jk {6 s ) dP jtk+1 (9 s 



P 3 k(9s) = Pj, k+ i{9 s ), and 







(12) 



d6 s 



(13) 
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But the identification Pjk{8 s ) = Pj,k+i(9 s ) is also where two CRCs intersect. Therefore, if the contribution 
to 1(9) from each of the other CRCs is negligible, then maximum information is obtained at the intersection 
of CRCs. 

Note that for polytomous scoring, up to K — 1 peaks at the solutions 9 S may be clearly identified from 
the shape of 1(9). When the parameters bjk for two neighboring CRCs match more closely, their information 
peaks can either form a plateau or converge into a single peak in the plot of 1(9) vs. 9. For example, if, 
instead of our example parameters being bjk = —2.0,0.0,1.5,2.5, we had bjk = —2.0,0.0,1.5,2.3, the total 
information curve would look like that in Figure [5] in which the right-most information peaks appear to 
merge into a plateau, whereas the first two peaks corresponding to bjk = —2.0, 0.0 are not affected. Or, if 
we had bjk = —2.0, 0.0, 1.5, 1.9, the two right-most information peaks would merge into one, as in Figure |6j 
while, again, the first two peaks would remain unaffected. 

1(0) 

2.5- 

/ \ r ' 

/ \ / 1.5- 

/ \ / 10 

/ 0.5- 




, . ■ 1 ■ ■ e 

-3-2-10 1 2 3 

Figure 5: Graph of the total information for a variation of the five CRCs in Figure [3] except with bjk = —2.0, 0.0, 1.5, 2.3 

The significance of the "plateau" and merged peaks is as follows: near the ability level 9 where this 
occurs, there exist two almost-simultaneous transitions between neighboring scores, i.e., from k — 1 — > k 
and fc — > k + 1. This suggests that around this 9, there exists a net transition from k — 1 — > k + 1, where 
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/ \ r 




/ \ / 1 R 
/ \ / 1 .0 




/ \ / 1,0 




/ 0.5 









-3-2-10 1 2 3 

Figure 6: Graph of the total information for a second variation of the five CRCs in Figure[3] except with bj^ = —2.0, 0.0, 1.5, 1.9 

participants are not very likely to respond with score k. As we illustrate later in this paper, such a merger 
suggests overlapping responses between two audience groups at that ability level, whereas for most of the 
items in our IRT analysis, we find little or no such overlapping responses. 
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4 PRINCIPAL COMPONENTS ANALYSIS 

The strength of item inter-correlations is determined via factor analysis (e.g. Lee & Ashton 2007, 
and references therein) . We chose the method of principal components analysis (PCA) , which reduces the 
number of AMI galaxy items to a set of factors. In PCA, each factor is aligned along a principal component 
(analogous to an axis in a coordinate system) with the first factor accounting for the most variance among 
the statements, the second factor accounting for the second most variance among the statements, and so 
forth. Orthogonal components are altogether unrelated to each other (Lee & Ashton 2007) . Varimax rotation 
is orthogonal rotation of the principal components to maximize the sum of the variances of the factors along 
each component (Kline 2005; Lee & Ashton 2007). Varimax rotation is the most preferred rotation for 
researchers interested in interpreting the factors and is also the rotation method that we use for the analysis 
presented in this paper. 

In considering correlations of when students dispelled or retained galaxy misconceptions, PCA de- 
termines the factors of statements that we consider "unidimensional enough" to perform IRT analysis, as 
discussed above. We performed PCA using the software suite SPSS, with varimax rotation on student re- 
sponses to galaxy misconceptions for all semesters to ensure maximum variance in the factors. Our results 
are presented in Figure [7] and are discussed here. The procedure for extracting factors is as follows: in PCA, 
a scree test (Cattell 1966) gives the relative variance of scores for up to as many factors as there are items, 
with the average variance for all scaled to 1 unit. The scaled variances are called eigenvalues A, and A = 1 
represents the mean variance of all of the items (galaxy statements). Kaiser (1960) suggests that the number 
of reasonable factors to extract is determined by how many factors have A > 1. Cattell suggests that one 
should assign a cutoff above the largest eigenvalue difference between two successive factors other than the 



first factor. Using either method, we are inclined from the scree plot in Figure 7(a) to extract three factors 
from our original 12 galaxy items. 

Once the components are rotated, loadings (or correlations) on each component can be determined 
for all items. The correlation of responses between two items a and b can be checked by calculating the 
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Scree Plot of Galaxy Statements from Fall 2009 to Fall 2012 




Rotated Component Matrix 
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Extraction Method: Principal Component 
Analysis. 

Rotation Method: Varimax with Kaiser 
Normalization. 

a. Rotation converged in 5 iterations. 

(b) 



Figure 7: Principal components analysis for the combined Fall 2009, Fall 2010, Fall 2011, and Fall 2012 timeline scores for 
galaxy misconceptions: (a) Scree plot, (b) components 



sample correlation coefficient C, given by 



C = 



J2 {ai - a)(bi - b) 

i=l 

Hv N 



(14) 



The correlation coefficients in Figure [7(b)] show the loading of each statement onto the respective component, 
with the factors highlighted in orange and other loadings of at least 0.3 highlighted in faint yellow. These 
lesser loadings indicate weaker component loadings, which indicates a less significant correlation of student 
responses to those in the factors. For example, the seven statement factor loads strongly on the first 
component, while student responses to statement sA227 correlate less strongly with those in the first factor. 
Similarly, the three statements in the second factor, related to the concept of "centralization" (see Table 
[3]), have well-correlated responses, and statement sA225 has responses that correlate somewhat with both 
factors 1 and 2. It is thus not surprising to see that these statements have well-correlated misconception 



scores. 



PCA says that the correlated responses to the galaxy items can be grouped into these smaller 
factors. The first factor consists of seven statements regarding primarily visual properties of galaxies, such 
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as their shapes, sizes, count, and internal structure with regard to the presence of gas, dust, and planet 
formation. The second factor consists of three statements regarding the idea of "centralization," that is, 
the misconceptions that we are at the galactic center, or the Milky Way is the center of the universe. The 
remaining two statements, in the final factor, probe student understanding of distributions, specifically of 
stars within galaxies and of the galaxies themselves throughout the universe. Our PCA results suggest that 
instructors should group galaxy concepts into these three factors. 

Part of our study is to look at inter-correlations of student responses. We note that from Figure 
sA218, the misconception that there is only one galaxy in the universe, has a loading of 0.431 on 
the second factor of galaxy statements, pertaining to centralization. Additionally, statement sA225, the 
misconception that there are only a few galaxies, has a 0.368 loading on this factor. Hence students who 
endorse misconceptions pertaining to centralization are also likely to say that there is only one or a few 
galaxies in the universe. 

To our surprise, sA227, the misconception that we see all the stars that are in the Milky Way, is 
best correlated with sA226, the misconception that the galaxies are randomly distributed, than with any 
particular statement in Factor 1, most of which relate to visual properties of galaxies. This suggests that 
when asked whether or not we can see all the stars in the Milky Way, students more often consider how 
the stars are distributed, rather than how bright or dim they are. Hence, sA226 and sA227 probe one's 
conceptual framework of "spatial distribution." 

5 IRT ANALYSIS ON THE GALAXY STATEMENT FACTORS 

IRT methodology, as discussed in Section [3j determines how the inclusive factor statements should 
be sequenced. We applied IRT methodology to our recoded scores (Table [4| using MULTILOG (DuToit 
2003), which generates, for each item, a plot of CRCs and the item information curve analogous to Figures 
[3] and [4] respectively. In each of these graphs, the left-side graphs in each Figure include item characteristic 
curves for the response codes (curves labeled by scores 1, 2, or 3) as a function of the student's ability 
to disabuse themselves of the misconceptions. To illustrate how to read the graphs, consider the left side 
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of Figure [HJ containing the CRCs for statement sA218, the misconception that the Milky Way is the only 
galaxy, for various levels of misconception endorsement (the horizontal (6) axis). The right-side graph in 
Figure [8] shows how informative the item is for various levels of misconception endorsement. As discussed at 
the end of section |3.4| the location on the horizontal axis corresponding to the information peak represent 
transitions between levels of misconception retainment (i.e., retention after AST 109 vs. disambiguation in 
AST 109). 




Misconception Endorsement Misconception Endorsement 

Figure 8: Galaxy statement sA218 characteristic curves for the first galaxy statement factor, including student responses from 
all semesters. For this and succeeding item characteristic curve plots, labels 1, 2, and 3 corresponding to black, blue, and purple 
curves respectively mean "unlearned prior to AST 109," "dispelled in AST 109," and "retained through AST 109" 

On all of these graphs, the latent trait "misconception endorsement" represents the tendency for 
students to endorse any particular galaxy misconception. Left to right indicates a higher tendency to retain 
galaxy misconceptions. We note from the figure, however, that most students, including those who are high 
on endorsing misconceptions, responded with either "1" (dispelled the misconception prior to AST 109) or 
"2" (dispelled the misconception in AST 109) for a broad range of misconception endorsement. So, even those 
students who are easily persuaded to endorse misconceptions dispelled this one, which indicates that "the 
Milky Way is the only galaxy" is an easy item, that is, one which is dispelled quite readily. For comparison, 
a hard item is one which is endorsed quite readily, and not easily dispelled. As we will demonstrate later, 
IRT graphs with mostly "1" or "2" responses represent easier items, while IRT graphs with mostly "2" or 
"3" responses represent harder items. 

We now discuss the results of performing IRT analysis on the first factor of galaxy statements: 
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sA218, sA219, sA220, sA225, sA228, sA230, and sA231. Figures [8pT] show the most interesting graphs 
generated by MULTILOG, which include the graphs for sA218, sA220, sA225, and sA228. The graphs for 



sA230 and sA231 look like those for sA225, and the graphs for sA219 look like those for sA220. 




MisconceptionEndorsement Misconception Endorsement 

Figure 9: Galaxy statement sA220 characteristic curves for the first galaxy statement factor, including student responses from 
all semesters 




MisconceptionEndorsement MisconceptionEndorsement 

Figure 10: Galaxy statement sA225 characteristic curves for the first galaxy statement factor, including student responses from 
all semesters 



We are now ready to determine the most logical order to teach galaxy topics in the first galaxy 
factor. The values in Table [5] are those identified with the "misconception endorsement" coordinate for each 
left-side graph. The left CRC intersections are identified with the coordinate in which the "1" and "2" 
CRCs intersect. Each intersection marks the latent trait in which children and adolescents are most likely 
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Misconception Endorsement Misconception Endorsement 

Figure 11: Galaxy statement sA228 characteristic curves for the first galaxy statement factor, including student responses from 
all semesters 



to transition over from dispelling the misconception to retaining it through adolescence. Similarly, each 
right intersection coordinate, representing where the "2" and "3" CRCs intersect, marks the latent trait in 
which students in AST 109 transition from dispelling the misconception in AST 109 to still retaining it even 
after instruction. Hence, we consider two audiences separately in our analysis: the left intersection scores 
represent transitions for children and adolescents, whereas the right intersection scores represent transitions 
for students in AST 109. We list the CRC intersection coordinate and the discrimination parameter cij 
(Section pT2| in Table [5] which we then use to suggest a sequence for the statements in order of progressively 
higher difficulty As we will argue, the sequence for the group of adults is different than the sequence for the 
group of children and adolescents. 

Table 5: Location of Galaxy Statement Factor 1 CRC intersections. For each column, an item with a lower coordinate is harder 
than items with higher intersection coordinates, indicating that it is particularly likely to be an easily-withheld misconception 



Statement aj Left CRC intersection Right CRC intersection 

sA218 1754 (L28 2M 

sA219 1.10 -0.92 2.76 

sA220 1.27 -1.32 2.61 

sA225 1.83 -0.34 2.21 

sA228 1.74 -0.76 2.63 

sA230 1.78 -0.77 2.50 

sA231 1.53 -0.89 2.54 



We remind the reader that the meaning of the "misconception endorsement" is to refer to the 
latent trait of the participant. So, lower intersection coordinates correspond to harder items. For example, 
sA218, the misconception that the Milky Way is the only galaxy, is the easiest item in Factor 1 for both 
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audiences, while sA220, the misconception that all galaxies are spiral, seems to be a hard item for children 
and adolescents, but is a medium-difficulty item for AST 109 students. Also, the misconception that there 
are only a few galaxies (sA225) is the hardest item in the factor for AST 109 students but the second easiest 
item for children and adolescents. 

Using the locations of the right intersections, the logical order for teaching the first factor of galaxy 
topics to adults is sA218, sA219, sA228, sA220, sA231, sA230, sA225. We note, however, that the relative 
locations of the left peaks, corresponding to disambiguation during childhood or adolescence vs. disam- 
biguation in AST 109, do not necessarily suggest that the logical order that applies for adults also applies to 
children and adolescents. Noting the positions of the left peaks from Table [5j we conclude that the logical 
order to teach the first factor of galaxy topics to children and adolescents is sA218, sA225, sA228, sA230, 
sA231, sA219, sA220. 

We now discuss the results of performing IRT analysis on the second factor of galaxy statements: 
sA221, sA222, and sA224. Of these statements, we show the MULTILOG graph for the most informative 
item in this factor, sA224, in Figure [12| The MULTILOG graphs of sA221 and sA222 are similar. All three 
statements relate to the idea of centralization, the misconceptions that we are at the center of the Milky 
Way galaxy or the universe. 




Misconception Endorsement Misconception Endorsement 

Figure 12: Galaxy statement sA224 characteristic curves for the second galaxy statement factor, including student responses 
from all semesters 



As before, we can use the CRC intersection coordinates (Table K3j) to suggest a logical order to teach 
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galaxy topics for the two audiences. Using IRT, our results suggest that sA224, the misconception that 
the Sun is at the center of the universe, should be taught first to both audiences. Thereafter, the optimal 
order for children and adolescents is to teach sA222 then sA221, whereas the optimal order for adults is 
the reverse. Interestingly, the closeness of the sA221 vs. sA222 left CRC intersections suggests that sA221 
and sA222 may be taught together. Using IRT, the most logical sequence is to show first that the Sun is 
not at the center of the universe, then show that the Milky Way is not at the center of the universe and 
that the Sun is not at the center of the Milky Way galaxy. Our findings thus suggest that the conceptual 
framework projected by adolescents about "centralization" does not change as adolescents transcend into 
adulthood. 



Table 6: Location of Galaxy Statement Factor 2 CRC intersections 



Statement 


a j 


Left CRC intersection 


Right CRC intersection 


sA221 


2.15 


-0.44 


1.58 


sA222 


2.19 


-0.40 


1.45 


sA224 


2.25 


-0.11 


1.65 



We now discuss the results of performing IRT analysis on the third and final factor of galaxy state- 



ments: sA226 and sA227, shown in Figures 13 and 14 The MULTILOG graph of sA226, the misconception 



that the galaxies are randomly distributed, is shown in Figure |13| We observe that this item represents 
the hardest of all the galaxy statements, because even those students who are generally low on endorsing 
misconceptions are likely to respond with "2" or "3," indicating high retention of the misconception. This 
is exactly the opposite of what we observe for sA218 (Figure [8]), which represents the easiest galaxy item. 
For sA226, we observe an information "plateau" for the 9 range -2 to 0, which is similar to the plateau 
constructed in Figure [5] The plateau thus suggests a mixing of the two transitions: adolescent retention 
vs. disambiguation and college student retention vs. disambiguation. We do not observe this for any other 
galaxy statement. Wc also include the MULTILOG graph for sA227, the misconception that we can see all 
the stars in the Milky Way, in Figure [14] 

As before, we can use the CRC intersection coordinates (Table [7]) to construct the most logical 
order to teach galaxy topics for both audiences. It turns out that the logical third galaxy factor order for 
adults is the same for children and adolescents: sA227, sA226. Our findings thus suggest that the conceptual 
framework projected by adolescents about spatial distributions of galaxies and the stars within them does 
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-1 1 

Misconception Endorsement 




-1 1 

Misconception E ndor setn ent 



Figure 13: Galaxy statement sA226 characteristic curves for the third galaxy statement factor, including student responses 
from all semesters 



not change as adolescents transcend into adulthood. 

Table 7: Location of Galaxy Statement Factor 3 CRC intersections 



Statement 


a j 


Left CRC intersection 


Right CRC intersection 


sA226 


1.24 


-1.88 


-0.05 


sA227 


1.09 


-0.78 


1.95 



Having sequenced the three galaxy factors individually, our final task is to sequence the factors 
themselves as a function of the audience. We calculate a left and right "misconception factor coordinate" 
(MFC) by choosing a "representative" weight for each statement in the factor, then computing the weighted 
mean of the intersection scores for all statements in each factor separately for the left and right intersections. 
In Section [3. 4| we noted that statements that are the most informative also have the highest discrimination 
parameters aj. Therefore, we choose to weight the statements by their respective discriminatory parameter. 
For left intersections, each with coordinate L, the left MFC for a given factor is given by 



Left MFC 



E a 3 L 3 

i_ 



(15) 



Likewise, for right intersections, each with coordinate R, the left MFC for a given factor is given by 



Right MFC 



J2 a j R j 
J_ 

3 



(16) 
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Misconception Endorsement Misconception Endorsement 

Figure 14: Galaxy statement sA227 characteristic curves for the third galaxy statement factor, including student responses 
from all semesters 

The most logical order to sequence the factors is in order from highest to lowest MFC, as lower scores in 
each column represent harder items, which thus reflect harder galaxy topics. We summarize our MFCs in 
Tabled 

Table 8: Misconception Factor Coordinate for the Three Galaxy Factors. Lower coordinates in each column represent harder 
groups of galaxy topics to teach 



Factor 


Left MFC 


Right MFC 


1 


-0.64 


2.58 


2 


-0.32 


1.56 


3 


-1.36 


0.88 



Based on our MFCs, the most logical sequence for adults is in the order of factors 1, 2, 3, whereas 
the most logical sequence for children and adolescents is 2, 1, 3. Our findings thus suggest that of the three 
factors (regarding visual galaxy properties, centralization, and spatial distribution), adolescents are most 
likely to refine their understanding of concepts related to visual galaxy properties as they enter adulthood. 



Our results for all galaxy factors and sequences for both age groups are summarized in Table [9] 

Table 9: Logical Sequence to Teach Galaxy Topics 



Age Group 


Factor 


Order of Statements 


Adults 


1 


sA218, sA219, sA228, sA220, sA231, sA230, sA225 




2 


sA224, sA221, sA222 




3 


sA227, sA226 


Children and Adolescents 


2 


sA224, sA222, sA221 




1 


sA218, sA225, sA228, sA230, sA231, sA219, sA220 




3 


sA227, sA226 
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6 SUMMARY AND CONCLUSIONS 

In this paper, we have summarized the mathematics underlying item response theory (IRT). We 
then applied item response theory to data collected from a study of astronomy misconceptions administered 
to college astronomy lecture students over three semesters. We then considered the special case of galaxies 
from this data. We used principal components analysis to determine galaxy items that clustered together 
based on correlated student responses. We then used IRT to scale the items for difficulty represented by the 
factors of items that we identified from factor analysis. Our work indicates that the order of difficulty in 
unlearning various incorrect beliefs about galaxies is a function of student age. Our results suggest that it 
would seem logical to allocate more classroom time in the course to those topics reflecting the most persistent 
misconceptions. 

We now summarize the procedure that we used to determine the most logical order to teach galaxy 
concepts: 

1. Recode student responses to the AST 109 Misconceptions Inventory (Appendix [B]) into an order of 
increased retention of misconceptions, termed "misconception scores." 

2. Perform principal components analysis on the misconception scores to extract the appropriate galaxy 
topic groups, each with well-correlated responses. 

3. Perform item response theory analysis using the graded response model on each factor of statements, 
one statement at a time, to generate an item characteristic curve for each score along an axis of 
increasing "misconception persistence." 

4. Use the locations of the category response curve intersections to sequence groups of items by order of 
highest misconception persistence. Since each intersection corresponds to a transition from one "age" 
of dispelling misconceptions to another age of dispelling misconceptions, these intersections represent 
the relative ages of the study participants. 

5. For each factor, sequence the items by the locations of their intersections. 
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6. Sequence the factors by a weighted average of the intersections for each age. 

This procedure leads to the prediction of the best order in which to present material to groups of different 
ages. Using this procedure, we find that the easiest misconception to dispel is that there is only one galaxy. 
We find that the hardest thing for all audiences to learn is that the galaxies are not randomly distributed. We 
also note that this misconception is strongly correlated with the misconception that we can see all the stars 
in the Milky Way, consistent with one's framework of "spatial distribution." We also suggest that students 
who endorse misconceptions about us being in the Milky Way's center or the Milky Way being at the center 
of the universe also endorse the misconception that there are no more than a few galaxies. 

We continue to conduct additional tests on student data for other sections (e.g. black holes, planets) 
of the AMI based on the procedure used for this paper. These results will be reported in subsequent 
papers. 

The work in this paper represents theory that makes predictions that we are also planning on 
testing through experiments involving teaching innovations with the cooperation of Prof. David Batuski at 
the University of Maine. If anyone else is interested in participating in the experimental part of this research, 
they are welcome to contact us. 

The authors gratefully acknowledge the assistance of Steve Balsis, Ph.D. (Psychology Department, Texas A & M 
University) with MULTILOG command file syntax. 
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Acronyms 



1PL: one-parameter logistic 



2PL: two-parameter logistic 



AMI: Astronomy Misconceptions Inventory 



CRC: character response curve 



CTT: classical test theory 



GRM: graded response model 



ICC: item characteristic curve 



IRT: item response theory 



MFC: misconception factor coordinate 



PCA: principal components analysis 
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B AST 109 Misconceptions Inventory NAME: 

Instructions for this Inventory: 

What this is all about: The reason this inventory is worth 5 points on your final grade is that it 
requires a fair amount of effort on your part, and two hours of your time. Your contribution will enable Dr. 
Comins and many other astronomy teachers to present the material in this course more effectively. Please 
do the following honestly and as completely as possible. Please work alone. 

Effects and risks of this research on you and others: Your responses will have no negative 
effect on your grade (and you will get the extra credit just for completing the inventory). While your only 
benefit in participating in this study is receiving extra credit, this research will help us better understand 
when people learn the correct science about astronomy. Data from this inventory will be used for research 
purposes in addition to improving the course. The only risks associated with participation in the study are 
your time and possible inconvenience during the inventory. 

Your options: Participation is voluntary. As a research participant you are free to skip any item 
that you do not wish to answer, and you may stop at any time. Once the assignment is handed in and extra 
credit points are given, your name will be removed and not associated with your responses. If you decide 
not to respond to the inventory, you will not obtain extra credit, of course, but otherwise your grade will be 
unaffected. As an alternative to earning extra credit by responding to this inventory, you are free to obtain 
equal extra credit by writing a paper on an astronomy topic that you and Dr. Comins agree upon. 

Security of this data: The data collected will be kept indefinitely in Dr. Comins locked office 
or similarly secure room. If you have any questions about the research project please contact Dr. Comins 
at 314 Bennett Hall or on First Class (Neil.Comins@umit.maine.edu). If you have any questions about 
your rights as a research participant, please contact Gayle Jones, Assistant to the University of Maine's 
Protection of Human Subjects Review Board, at 114 Alumni Hall (phone 581-1498), or on First Class 
(gayle.jones@umit.maine.edu) . 

A) After the number for each statement please write: 

A if you believed it only as a child 

B if you believed it through high school 

C if you believe it now 

D if you believed it, but learned otherwise in AST 109 

If you never thought about a certain statement, please consider it now. 

Write E if the statement sounds plausible or correct to you. 

Write F if you never thought about it before, but think it is wrong now. 

B) If you believe a statement is wrong, please briefly correct it in the space below. 
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Stars: 



sAl. 


all of the stars were created at the same time 


sA2. 


there are 12 zodiac constellations 


sA3. 


all of the stars are about as far away from the Earth as the Moon 


sA4. 


all stars are white 


sA5. 


the constellations are only the stars we connect to make patterns 


sA6. 


we are looking at stars as they are now 


sA7. 


stars actually twinkle — change brightness 


sA8. 


the north star is the brightest star in the sky 


sA9. 


stars have spikes sticking out of them 


sAlO. 


all stars have planets 


sAll. 


stars last forever 


sA12. 


the brighter a star is, the hotter it is 


sA13. 


all stars are evenly distributed on the celestial sphere 


sA14. 


all stars are the same distance from the Earth 


sA15. 


all stars have same color and size 


sA16. 


pulsars are pulsating stars 


sA17. 


all stars are smaller than the Sun 


sA18. 


the galaxy, solar system and universe are the same things 


sA19. 


stars create matter from nothing 


sA20. 


stars just exist — they don't make energy or change size or color 


sA21. 


all stars end up as white dwarves 


sA22. 


all stars are stationary — fixed on the celestial sphere 


sA23. 


stars emit only one color of light 


sA24. 


stars are closer to us than the Sun 


sA25. 


there are exactly 12 constellations 


sA26. 


the closer a star is to the Earth, the brighter it must be 


sA27. 


all the stars in an asterism move together 


sA28. 


a nova is the most powerful explosion 
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sA29. stars in the Milky Way are as close to each other as planets are to the Sun 

sA30. stars run on fuel: gasoline or natural gas 

sA31. "metals" have always existed in the universe 

sA32. stars follow you in your car 

sA33. we see the same constellations at night throughout the year 

sA34. stars are fixed in space 

sA35. stars in a binary system (two stars bound together by their gravity) would quickly collide 

sA36. we see all the stars in the Milky Way galaxy 

sA37. all stars are isolated from all other stars (none are binary) 

sA38. binary stars are physically very close to each other 

sA39. Polaris will always be in our north 

Solar System, Misc.: 

sA40. the asteroid belt is an area like we see in star wars, very densely packed 

sA41. Mercury is so named because there is much mercury on it 

sA42. comet tails are burning — because the comet is moving so fast 

sA43. there is plant life on other planets in our solar system 

sA44. Pluto is always farther from the Sun than is Neptune 

sA45. a shooting star is actually a star whizzing across the universe or falling through the sky 

sA46. Jovian planets (Jupiter, Saturn, Uranus, Neptune) have solid surfaces 

sA47. the asteroid belt is between Earth and Mars 

sA48. the Solar System is the whole universe or the whole galaxy 

sA49. Jupiter is almost large and massive enough to be a star 

sA50. all orbits around Sun are circular 

sA51. planets revolve around the Earth 

sA52. all planets orbit exactly in the plane of the ecliptic 

sA53. Pluto is a large, jovian (Jupiter-like) planet 

sA54. all constellations look like things they are named for 
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sA55. Titan's atmosphere could sustain life 

sA56. comets last forever 

sA57. each planet has one moon 

sA58. Mercury (closest planet to the Sun) is hot everywhere on its surface 

sA59. the day on each planet is 24 hours long 

sA60. all stars have prograde rotation (spin same way as the Earth) 

sA61. comets are pulled out from Jupiter 

sA62. there are no differences between meteors, meteorites, meteoroids 

sA63. asteroids, meteoroids, comets are same 

sA64. Jupiter's great red spot is a(n) . (please fill in) 

sA65. all planets have the same orbital inclinations (tilt of axis with respect to ecliptic) 

sA66. optical telescopes are the only "eyes" astronomers have on the universe 

sA67. humans have never landed a spacecraft on another planet 

sA68. we do not have telescopes in space 

sA69. all the planets in our solar system have been known for hundreds of years 

sA70. comets are molten rock hurtling through space at high speeds and their tails are jet wash behind 
them 

sA71. comets only occur once every decade or so 

sA72. there are many galaxies in a solar system 

sA73. planets twinkle in the night sky 

sA74. comet tails always trail behind them 

sA75. comets are solid, rocky debris 

sA76. Jupiter's great red spot is a volcano erupting 

Moon : 

sA77. there is only one moon — ours 

sA78. the Moon doesn't cause part of the tides 

sA79. we see all sides of the Moon each month 

sA80. craters are volcanic in origin 
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sA81. when the Moon is in the first quarter, we see a quarter of the Moon 

sA82. the Moon gives off its own energy 

sA83. the Moon is at a fixed distance from Earth 

sA84. the Moon changes physical shape throughout its cycle of phases 

sA85. the Moon doesn't rotate since we see only one side of it 

sA86. the Moon is made of sand 

sA87. the Moon has seas and oceans of water 

sA88. the Moon is older than the Earth: a dead planet that used to be like Earth 

sA89. the Moon is about the same temperature as the Earth 

sA90. the Moon has a helium atmosphere 

sA91. the Moon has an atmosphere like the Earth 

sA92. the Moon has a smooth surface 

sA93. the Moon sets during daylight hours and is not visible then 

sA94. there is a real man in the Moon 

sA95. the Moon is densest at its geometric center 

sA96. because the Moon reflects sunlight, it has a mirror-like surface 

sA97. the Moon will someday crash into Earth 

sA98. the Moon is a captured asteroid 

sA99. a lunar month is exactly 28 days long 

sAlOO. at new Moon we are seeing the "far" side of the Moon 

sAlOl. a flag left by astronauts is visible from Earth with the naked eye 

sA102. the Moon follows you in your car 

sA103. the Moon is larger at the horizon than when it is overhead 

sA104. the side of the moon we don't see is forever "dark" 

sA105. the moon is lit by reflected "Earth light" (that is, sunlight scattered off the Earth toward the 
Moon) 

Venus : 

sA106. life as we know it can exist on Venus 
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sA107. clouds on Venus are composed of water, like clouds on earth 

sA108. Venus is very different from earth in size 

sA109. Venus is a lot like the earth in temperature 

sAHO. Venus is always the first star out at night 

Earth : 

sAlll. Earth's axis is not tilted compared to the ecliptic 

sA112. summer is warmer because we are closer to the sun during the summertime 

sA113. once ozone is gone from the Earth's atmosphere, it will not be replaced 

sA114. Earth and Venus have similar atmospheres 

sA115. Earth is at the center of the universe 

sA116. Earth is the biggest planet 

sA117. the Earth has always existed 

sA118. Spring Tide is in the spring 

sA119. there is no precession (change in where the Earth's north pole points in space) 

sA120. solar eclipses are over in a few seconds 

sA121. comets never collide with Earth 

sA122. X-rays can reach the ground 

sA123. all radiation from space can reach the ground 

sA124. Earth and Mars have similar atmospheres 

sA125. meteoroids Enter the atmosphere a few times a night 

sA126. you can see a solar eclipse from anywhere on Earth that happens to be facing the Sun at that 
time 

sA127. auroras are caused by sunlight reflecting off polar caps 

sA128. the Moon is not involved with any eclipses 

sA129. the day has always been 24 hours long 

sA130. the air is a blue gas 

sA131. Halley's comet will eventually hit Earth 

sA132. the Earth came from another planet that exploded 
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sA133. the sun orbits the Earth 

sA134. the Earth is the largest planet 

sA135. solar eclipses happen about once a century and are seen everywhere on Earth 

sA136. Earth's core is completely solid 

sA137. only Earth among the planets and moons has gravity 

sA138. Earth is at the center of the solar system 

sA139. the ozone layer gives us acid rain 

sA140. all space debris reaches the ground intact 

sA141. seasons were chosen haphazardly 

sA142. meteorites have stopped falling onto the Earth 

sA143. the Earth will last forever 

sA144. the Earth's magnetic poles go through its rotation poles 

sA145. planes can fly in space 

sA146. a day is exactly 24 hours long 

sA147. a year is exactly 365 days long 

sA148. seasons are caused by speeding up and slowing down of Earth's rotation 

sA149. the Earth orbits the sun at a constant speed 

sA150. the Earth is in the middle of the Milky Way galaxy 

sA151. the sky is blue because it reflects sunlight off oceans and lakes 

sA152. the Earth is the only planet with an atmosphere 

sA153. comets affect the weather 

sA154. the Earth is not changing internally 

sA155. sunset colors are caused by clouds 

sA156. the tides are caused just by the Earth's rotation 

sA157. Earth has a second moon that only comes around once in awhile — "once in a blue moon" 

sA158. the Sun is directly overhead everywhere on Earth at noon 

sA159. tides are caused just by ocean winds 

sA160. the Earth is flat 
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Mars : 

sA161. Mars is green (from plant life) 

sA162. all erosion on Mars is due to water flow 

sA163. Mars has life on it now 

sA164. Mars has running water now 

sA165. Mars could be made inhabitable 

sA166. Mars is the second largest planet 

sA167. life, when it did exist on Mars, was quite advanced 

sA168. there are Lowellian canals on Mars built by intelligent beings 

sA169. Mars is Hot because it is red . . . Mars — god of fire 

sA170. Mars is the sister planet to earth in physical properties and dimensions 

Saturn : 

sA171. Saturn is the only planet with rings 

sA172. Saturn's rings are solid 

sA173. a nebula is a shiny ring around Saturn 

sA174. Saturn's rings are caused by the planet spinning so fast 

sA175. Saturn's rings are made of gas 

sA176. Saturn has only one ring 

Sun : 

sA177. the Sun is a specific type of astronomical body with its own properties. It is not a star 

sA178. the Sun will burn forever 

sA179. the solar wind is a strong wind that blows all over the universe 

sA180. the Sun is the hottest thing in the galaxy 

sA181. the Sun does not move through space 

sA182. the Sun does not cause part of the tides 

sA183. sunspots are hot spots on the Sun's surface 
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sA184. the Sun will blow up, become a black hole, and swallow the earth 

sA185. the Sunspot cycle is 11 years long 

sA186. the Sun's surface temperature is millions of degrees Fahrenheit 

sA187. Sunspots are constant fixtures on the sun 

sA188. the Sun is yellow 

sA189. the Sun is the brightest star in universe 

sA190. the Sun is the brightest object in the universe 

sA191. the Sun always sets due west 

sA192. the Sun is made of fire 

sA193. the Sun is a "heat" planet 

sA194. the Sun is a bunch of stars clumped together 

sA195. the Sun is getting smaller 

sA196. the Sun is the smallest star in the universe 

sA197. the Sun has no atmosphere 

sA198. the Sun is the largest star 

sA199. the Sun is hottest on its surface 

sA200. the Sun has a solid core 

sA201. the Sun has only a few percent of the mass in the solar system 

sA202. the Sun is mostly iron 

sA203. the Sun is the closest planet to the earth 

sA204. the Sun's surface is perfectly uniform 

sA205. Sunspots indicate that the Sun is getting cooler each year and will eventually become too cool to 
support life 

sA206. the entire Sun is molten lava 

sA207. the solar wind only occurs occasionally 

sA208. the Sun will explode as a nova 

sA209. the Sun is hottest star 

sA210. you can look at the Sun and not go blind 

sA211. it is possible that the Sun could explode in the "near future" 
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sA212. after the sun's core becomes helium, the sun will immediately become a white dwarf 
sA213. the Sun doesn't rotate 

sA214. the Sun is the only source of light in the galaxy — Sunlight reflects off planets and stars so we 
can see them. 

sA215. Sunspots are where meteors crash into the Sun 

sA216. solar wind is a dense wind like winds we experience here on earth 

sA217. it is more dangerous to look at the Sun during an eclipse because the radiation level from sun is 
greater then, than when there is no eclipse 

Galaxies : 

sA218. the Milky Way is the only galaxy 

sA219. the solar system is not in the Milky Way (or any other) galaxy 

sA220. all galaxies are spiral 

sA221. the Milky Way is the center of the universe 

sA222. the Sun is at the center of the Milky Way galaxy 

sA223. the Milky Way galaxy consists of just our solar system 

sA224. the Sun is at the center of the universe 

sA225. there are only a few galaxies 

sA226. the galaxies are randomly distributed 

sA227. we see all the stars that are in the Milky Way 

sA228. all galaxies are the same in size and shape 

sA229. asteroid belts exist throughout our galaxy 

sA230. the Milky Way is just stars — no gas and dust 

sA231. new planets and stars don't form today 

Black Holes : 

sA232. black holes create themselves from nothing 
sA233. black holes last forever 
sA234. black holes really don't exist 
sA235. black holes are empty space 
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sA236. we can't see a black hole 

sA237. black holes do not have mass 

sA238. black holes are like huge vacuum cleaners, sucking things in 

sA239. black holes are bottomless pits 

sA240. black holes are doors to other dimensions 

sA241. black holes are black 

sA242. black holes can be seen visually, like seeing a star or planet 

sA243. we could live in a voyage through a black hole 

sA244. we could travel through time in a black hole 

sA245. black holes get bigger forever and nothing can stop them from doing so 

sA246. black holes are actual holes in space 

sA247. a single black hole will eventually suck in all the matter in the universe 

General Astrophysics: 

sA248. cosmic rays are light rays 

sA249. the universe has always existed 

sA250. there are other planets in our Milky Way galaxy on which advanced life exists 

sA251. Earth has been visited by aliens 

sA252. astronomy and astrology are the same thing 

sA253. gravity will eventually pull all the planets together 

sA254. satellites need continuous rocket power to stay in orbit around the Earth 

sA255. light travels infinitely fast 

sA256. space is infinite 

sA257. space is finite 

sA258. telescopes cannot see any details on any of the planets 

sA259. gravity is the strongest force in the universe 

sA260. color and temperature of a star are unrelated 

sA261. we can hear sound in space 
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sA262. the universe as a whole is static (unchanging) 

sA263. astronomical ideas of mass, distance, and temperature of planets are all speculative 

sA264. changes in stars, planets, and galaxies occur in human lifetimes 

sA265. there is air in space 

sA266. universe is filled with stars 

sA267. there is a center to the universe 

sA268. ice doesn't exist in space 

sA269. everything in the universe is accelerating away from everything else 

sA270. smaller telescopes enable astronomers to see smaller details 

sA271. the most important function of a telescope is magnification 

sA272. all space debris existing today is the result of planet collisions and explosions on planets 

sA273. astronomers mostly work with telescopes 

sA274. it is possible to travel faster than the speed of light (that is, at "warp speed") 
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